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SYSTEM OF ANALYZING COMPLEX MIXTURES OF BIOLOGICAL AND 
OTHER FLUIDS TO IDENTIFY BIOLOGICAL STATE INFORMATION 

CROSS REFERENCE TO RELATED APPLICATIONS 
[0001] This application claims priority to U.S. Provisional Patent Application Serial 
Number 60/473,272 filed May 22, 2003. Such application is incorporated by reference for 
all purposes. 

BACKGROUND OF THE INVENTION 
[0002] The present inventions provide a business system and method for pharmaceutical, 
diagnostic, and biological research as well as applications of such research. 
[0003] A common aspect of all life on earth is the use of polypeptides as functional 
building blocks and the encryption of the instructions for the building blocks in the 
blueprint of nucleic acids (DNA, RNA). What distinguishes between living entities lies in 
the instructions encoded in the nucleic acids of the genome and the way the genome 
manifests itself in response to the environment as proteins. The complement of proteins, 
protein fragments, and peptides present at any specific moment in time defines who and 
what we are at that moment, as well as our state of health or disease. 
[0004] One of the greatest challenges facing biomedical research and medicine is the 
limited ability to distinguish between specific biological states. This is reflected in the 
limited ability to detect the earliest stages of disease, anticipate the path any apparent 
disease will take in one patient versus another, predict the likelihood of response for any 
individual to a particular treatment, and preempt the possible adverse affects of treatments 
on a particular individual. 

[0005] New technologies and strategies are needed to inform medical care and improve 
the repertoire of medical tools, as well as business methods to utilize such technologies 
and strategies. 



BRIEF SUMMARY OF THE INVENTION 
[0006] According to one embodiment of the invention, a business method is provided that 
includes the steps of collecting more than 10 case samples representing a clinical 
phenotypic state and more than 10 control samples representing patients without said 
clinical phenotypic state; using a mass spectrometry platform system to identify patterns 
of polypeptides in said case samples and in said control samples without regard to the 
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specific identity of at least some of said proteins; identifying representative patterns of the 
phenotypic state; and marketing diagnostic products using said representative patterns. 
Such patterns contain preferably more than 15 polypeptides that are represented on output 
of said mass spectrometer, but the identity of at least some of said more than 15 
polypeptides is not known. 

BRIEF DESCRIPTION OF THE FIGURES 
[0007] Figure 1 is an overall flowchart illustrating the operation of one embodiment of the 
business method. 

[0008] Figure 2 is a diagram illustrating preferred aspects of the invention herein. 
[0009] Figure 3 illustrates one mass spectrometer that may be used herein. 

DETAILED DESCRIPTION OF THE INVENTION 
[0010] The business methods herein utilize and apply a system that is able to differentiate 
biological states with reliability, reproducibility, and sensitivity. In one embodiment, the 
system relies on an integrated, reproducible, sample preparation, separation, and 
electrospray ionization system in a micro fluidics format, with high sensitivity mass 
spectrometry and informatics. This system will serve as the foundation for the discovery 
of patterns of polypeptides that reflect and differentiate biological states specific for 
various states of health and disease. Polypeptides includes, for purposes herein, e.g. 
proteins, peptides, and/or protein fragments. These patterns of polypeptides that reflect 
and differentiate biological states will then be utilized in clinically useful formats and in 
research contexts. Clinical applications will include detection of disease; distinguishing 
disease states to inform prognosis, selection of therapy, and the prediction of therapeutic 
response; disease staging; identification of disease processes; prediction of efficacy; 
prediction of adverse response; monitoring of therapy associated efficacy and toxicity; and 
detection of recurrence. 

[0011] Figure 1 illustrates the overall process of the business methods disclosed herein. 
At step 101 the involved business (alone or with collaborators) collects a representative 
sample set of case samples and control samples. The case samples will be those wherein a 
patient exhibits a particular disease state or other phenotype. For example, the case 
samples may be those where a patient exhibits a response to a drug. Conversely, the 
control samples will be collected from patients that do not exhibit the phenotype under 
study, such as those that do not have the disease or response to a drug. Preferably more 
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than 10 case and 10 control samples are collected for use. Preferably more than 20 case 
and 20 control samples, preferably more than 50 case and 50 control samples, preferably 
more than 100 case and 100 control samples, and most preferably more than 500 case and 
500 control samples are collected. 

[0012] At step 103 the case and control samples are assayed to identify patterns of 
markers that are present in the case and control samples. In preferred embodiments the 
markers are polypeptides such as proteins, although they may also include small 
molecules, nucleic acids, polysaccharides, metabolites, lipids, or the like. Preferably, the 
patterns are obtained without advance selection or screening of the particular polypeptides 
involved. In some embodiments the patterns are obtained without identification of some 
or all of the markers that are shown in the pattern. Three conceptual patterns are 
illustrated for cases at 104a and controls at 104b. As shown, the patterns are greatly 
simplified from those that will be actually observed. 

[0013] Preferably the assay identifies the presence of more than 100 polypeptides, 
preferably more than 200 polypeptides, more preferably more than 500 polypeptides, more 
preferably more than 1000 polypeptides, and more preferably more than 2000 
polypeptides. While the identity of some of the polypeptides will be known from prior 
studies, it is not necessary to specifically identify all of the polypeptides indicated by the 
assay. Instead, the business takes advantage of the presence of (or absence of) a pattern of 
many polypeptides repeatedly found to be in the cases in a pattern distinct from the 
controls. In various embodiments a number of polypeptides are represented in the pattern, 
but the identity of some of these polypeptides is not known. For example, more than 15 
polypeptides can be represented, more than 30 polypeptides can be represented, more than 
50 polypeptides can be represented, more than 100 polypeptides can be represented, and 
more than 1000 polypeptides can be represented. 

[0014] In preferred embodiments, the business relies on a mass spectrometry system to 
perform the assays. Preferably such systems allow for the capture and measure of most or 
all of the instances of a polypeptide in a sample that is introduced in the mass spectrometer 
for analysis. Using such systems it is preferable that one can observe those polypeptides 
with high information-content but that are only present at low concentrations, such as 
those "leaked" from diseased tissue. 

[0015] In some embodiments, an early assay, such as the first assay, is followed by a later 
assay. The early assay will be normally be used in initial identification of the 
polypeptides that identify or separate cases from controls. The later assay is adjusted 
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according to parameters that can focus diagnostics or evaluation of regions of interest, 
such as regions of high variability, i.e. those regions or markers where there are significant 
differences between case samples and control samples. The parameters can be determined 
by, for example, an early assay which may identify the regions of interest, which may be 
on one technology platform, and a later assay on the same or a different platform. 
[0016] At step 105 bioinformatics systems are utilized to identify the differences in the 
polypeptide patterns in the case and control samples. Such techniques may be proceeded 
by various data cleanup steps. Patterns will be composed of the relative representation of 
numerous polypeptides or other biological entities, the collective profile of which will be 
more important than the presence or absence of any specific entities. By identifying 
patterns in blood or other patient samples, the methods will not only provide the window 
to the presence of disease and other pathology in some embodiments, but also to the 
ongoing response to the disease or pathologic condition in other embodiments. In a high 
throughput mode, data from a first sample are evaluated in a bio-informatics system at the 
same time another sample is being processed in, for example, a mass spectrometry system. 
[0017] As shown in the three simplified patterns for "cases" 104a, peaks 106a and 106b 
tend to be observed in three "case" samples at higher levels. Conversely, less or no signal 
is observed at peak 106c in the three case samples. By contrast, in the control samples 
104b, peaks 106a and 106c tend to be observed while peak 106b tends to be at low levels. 
Of course, the patterns shown in Fig. 1 are greatly simplified, and there will be much more 
complex patterns in actual practice, such as tens, hundreds, or thousands of such peaks. 
In the particular example illustrated in Fig. 1, peak 106a is not informative, while peak 
106b tends to occur in cases, and peak 106c tends to occur in controls. Automated 
systems will generally be applied in the identification of the patterns that distinguish cases 
and controls. The measurement of patterns of multiple signals will enable the 
identification of subtle differences in biological state and make the identification of that 
state more robust and less subject to biological noise. 

[0018] At step 107 the business uses the patterns of polypeptides present in the sample to 
identify the disease state of a patient sample in, for example, a diagnostic setting. Samples 
used in both the steps 101 and 107 will, in preferred embodiments be serum samples, 
although tissue samples from a variety of sources will be used in alternative embodiments. 
Preferably, though not necessarily, the system used in the diagnostic application is based 
upon the same technology platform as the platform used to identify the patterns in the first 
instance. For example, if the platform used to identify the patterns in the first instance is a 
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time of flight mass spectrometer, it is preferred that the diagnostic applications of the 
patterns are run on a time of flight mass spectrometer. 

[0019] The marketing of the products can take a number of forms. For example, it may be 
that the developer actually markets the instruments and assays into the diagnostic research 
market. In alternative embodiments, the developer of the patterns will partner with, for 
example, a large diagnostic company that will market those products made by the 
developer, alone or in combination with their own products. In alternative embodiments, 
the developer of the patterns licenses the intellectual property in the patterns to a third 
party and derives revenue from licensing income arising from the pattern information. 
[0020] The business method herein can obtain revenue by various means, which may 
vary over time. Such sources may include direct sale revenue of products, upfront license 
fees, research payment fees, milestone payments (such as upon achievement of sales goals 
or regulatory filings), database subscription fees, and downstream royalties and from 
various sources including government agencies, academic institution and universities, 
biotechnology and pharmaceutical companies, insurance companies, and health care 
providers. 

[0021] Often, diagnostic services hereunder will be offered by clinical reference 
laboratories or by way of the sale of diagnostic kits. Clinical reference laboratories 
generally process large number of patient samples on behalf of a number of care givers 
and/or pharmaceutical companies. Such reference laboratories in the United States are 
normally qualified under CLIA and/or CAP regulations. Of course, other methods may 
also be used for marketing and sales such as direct sales of kits such as FDA or equivalent 
approved products. In some cases the developer of the pattern content will license the 
intellectual property and/or sell kits and/or reagents to a reference laboratory that will 
combine them with other reagents and/or instruments in providing a service. 
[0022] In the short term, the business methods disclosed generate revenue by, for 
example, providing application specific research or diagnostic services to third parties to 
discover and/or market the patterns. Examples of third-parties include customers who 
purchase diagnostic or research products (or services for discovery of patterns), licensees 
who license rights to pattern recognition databases, and partners who provide samples in 
exchange for downstream royalty rights and/or up front payments from pattern 
recognition. Depending on the fee, diagnostic services may be provided on an exclusive 
or non-exclusive basis. 
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[0023] Revenue can also be generated by entering into exclusive and/or non-exclusive 
contracts to provide polypeptide profiling of patients and populations. For example, a 
company entering clinical trials may wish to stratify a patient population according to, for 
example, drug regimen, effective dosage, or otherwise. Stratifying a patient population 
may increase the efficacy of clinical trial (by removing, for example, non responders), thus 
allowing the company to enter into the market sooner or allow a drug to be marketed with 
a diagnostic test that identifies patients that may have an adverse response or be non- 
responsive. In addition, insurance companies may wish to obtain a polypeptide profile of 
a potential insured and/or to determine if, for example a drug or treatment will be effective 
for a patient. 

[0024] In the long term, revenue may be generated by alternative methods. For example, 
revenue can be generated by entering into exclusive and/or non-exclusive drug discovery 
contracts with drug companies (e.g., biotechnology companies and pharmaceutical 
companies). Such contracts can provide for downstream royalties on a drug based on the 
identification or verification of drug targets (e.g., a particular protein or set of polypeptides 
associated with a phenotypic state of interest), or on the identification of a subpopulation 
in which such drug should be utilized. Alternatively, revenue may come from a licensee 
fee on a diagnostic itself. The diagnostic services, patterns, and tools herein can further be 
provided to a pharmaceutical company in exchange for milestone payments or 
downstream royalties. Revenue may also be generated from the sale of disposable fluidics 
devices, disposable microfluidics devices, or other assay reagents or devices in for 
example the research market, diagnostic market, or in clinical reference laboratories. 
Revenue may also be generated from licensing of applications-specific software or 
databases. Revenue may, still further, be generated based on royalties from technology 
platform providers who may license some or all of the proprietary technology. For 
example, a mass-spectrometer platform provider may license the right to further distribute 
software and computer tools and/or polypeptide patterns. 

[0025] In preferred embodiments, the TOF device utilized herein is coupled to a 
microfluidic separations device. The sample preparation techniques preferably 
concentrate the polypeptides the mass spectrometer is best able to detect and/or are which 
are most informative, and deplete the ones that are more difficult to detect and/or are less 
informative. In most preferred embodiments the microfluidic separations device is a 
disposable device that is readily attached to and removed from the TOF mass 
spectrometer, and sold as a disposable, thereby providing a recurring revenue stream to the 
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involved business. Preferably, a mass spectrometer is utilized that will accept a continuous 
sample stream for analysis and provide high sensitivity throughout the detection process. 
[0026] Sample preparation will, in some embodiments, include the removal of high 
abundance polypeptides, removal of polypeptides expected to be in abundance in all 
samples, addition of preservatives and calibrants, and desalting. These steps will allow 
sensitive measurement of concentrations of information-rich polypeptides, such as those 
that have leaked from tissue, as compared to polypeptides that would carry little 
information, such as those highly abundant and native to serum. Prepared samples will 
then be separated using fast molecular separations methods with high peak capacities. An 
electrospray ionization (ESI) interface may be integrated on the microfluidics chip, which 
will ionize and spray the prepared and separated serum directly into a mass spectrometer. 
The microfluidics-based separations preferably provide the polypeptide mixtures at flow 
rates and at complexity levels that are matched to the mass spectrometer's optimal 
performance regions. The mass spectrometer's sensitivity is preferably optimized to 
detect the species most likely to differentiate biological states. Preferably, the reagents 
necessary for performing these steps are provided in or along with the microfluidics 
device, thereby allowing for additional recurring revenue to the involved business. 
[0027] The system used for removal of high abundance polypeptides may be based on, for 
example, the use of high affinity reagents for removal of the polypeptides, the use of high 
molecular weight filters, ultracentrifugation, precipitation, and/or electrodialysis. 
Polypeptides that will often be removed will include, for example, those involved in 
normal metabolism, and a wide variety of other indications not of relevance to a particular 
assay. 

[0028] Figure 2 illustrates additional aspects of an exemplary system platform used 
herein. The invention involves an integrated system to a) discover and b) assay patterns 
of polypeptides that reflect and differentiate biological and clinical states of organisms, 
including patients, in biological materials including but not limited to body fluids. 
Biological and clinical states include but are not limited to states of development; age; 
health; pathology; disease detection, process, or staging; infection; toxicity; or response to 
chemical, environmental, or drug factors (such as drug response phenotyping, drug 
toxicity phenotyping, or drug effectiveness phenotyping). Biological fluids 201 include 
but are not limited to serum, plasma, whole blood, nipple aspirate, pancreatic fluid, 
trabecular fluid, lung lavage, urine, cerebrospinal fluid, saliva, sweat, pericrevicular fluid, 
and tears. 
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[0029] The system provides for the integration of fast molecular separations and 
electrospray ionization system 204 on a micro fluidics platform 203. The system provides 
processed samples to a high sensitivity time of flight mass spectrometer 205. Signal 
processing system and pattern extraction and recognition tools 205 incorporate domain 
knowledge to extract information from polypeptide patterns and classify the patterns to 
provide a classification 209. 

[0030] The microfluidics device(s) 203 may be formed in plastic by means of etching, 
machining, cutting, molding, casting or embossing. The microfluidics device(s) may be 
made from glass or silicon by means of etching, machining, or cutting. The device may be 
formed by polymerization on a form or other mold. The molecular separations unit or the 
integrated fast molecular separations/electrospray ionization unit may provide additional 
sample preparation steps, including sample loading, sample concentration, removal of salts 
and other compounds that may interfere with electrospray ionization, removal of highly 
abundant species, proteolytic or chemical cleavage of components within the biological 
material, and/or aliquoting in to storage containers. 

[0031] The device(s) for separations and electrospray may be either single use for a single 
sample, multi-use for a single sample at a time with serial loading, single use with parallel 
multiple sample processing, multi-use with parallel multiple sample processing or a 
combination. Separations processes may include isoelectric focusing, electrophoresis, 
chromatography, or electrochromatography. The separations device may include 
collection areas or entities for some or all of the purified or partially purified fractions. 
[0032] Figure 3 illustrates a mass spectrometer system 205 in greater detail in one specific 
embodiment of the invention. In Figure 3, an orthogonal multiplex time-of-flight mass 
spectrometer includes an analyzer that receives an ion beam from an electrospray 
ionization (ESI) source 301 such as disclosed in U. S. Ser. No. 10/395,023. By 
"multiplex" in this context it is intended to mean a system that processes multiple ion 
packets at the same time. The ion beam is initially introduced into analyzer 303 along an 
axis 305, and the analyzer generally accumulates differing size packets of ions of the beam 
and accelerates the packets of ions laterally along a flight path 307. The pulses or packets 
of ions are spaced in time and along the flight path by different accumulation periods, and 
the speed of travel of the ions along flight path 307 varies with a mass-to-charge ratio 
(m/z) such that the ions of sequential pulses, and often the ions of three or more pulses, 
will arrive intermingled at one time at a detector 309. 
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[0033] In addition to analyzer 303, the system includes a driver 31 1 to intermittently 
energize lateral acceleration electrodes of analyzer 303. Driver 311 modulates or encodes 
the beam with the pseudorandom sequence by reference to a clock signal supplied from a 
multichannel scaler 313. Driver 31 1 also supplies a trigger signal to the multichannel 
scaler 313 to signal the start of a sequence. An output signal from detector 309 is 
amplified by an amplifier 315 and is counted by multichannel scaler 313. 
[0034] The pseudorandom sequence applied by driver 31 1 will typically provide for time 
periods which may each be defined as integer multiples of a unit accumulation time. To 
facilitate reconstruction of a spectrum from the signal generated by detector 309, 
multichannel scaler 3 1 3 may count the amplified signal from amplifier 315 into time bins 
which represent integral fractions of this unit time. These counts can then be sent to a 
computer 317 for reconstruction of a particular spectra and characterization of the sample 
material introduced into the system via ESI source 301. 

[0035] Computer 317 may also control a variety of additional components of system 205, 
with a wide variety of alternative data processing being possible. The structure and use of 
driver 311, multichannel scaler 313, amplifier 315 and computer 317 may in some 
embodiments be those such as shown in U.S. patent 6,300,626 issued to Brock et al. and 
entitled "Time-of-Flight Mass Spectrometer and Ion Analysis" on October 9, 2001, which 
is fully incorporated by reference along with all other references cited in this application. 

Sample Collection 

[0036] Case samples are obtained from individuals with a particular phenotypic state of 
interest. Examples of phenotypic states include, phenotypes resulting from an altered 
environment, drug treatment, genetic manipulations or mutations, injury, change in diet, 
aging, or any other characteristic(s) of a single organism or a class or subclass of 
organisms. In a preferred embodiment, a phenotypic state of interest is a clinically 
diagnosed disease state. Such disease states include, for example, cancer, cardiovascular 
disease, inflammatory disease, and infectious disease. Control samples are obtained from 
individuals who do not exhibit the phenotypic state of interest or disease state (e.g., an 
individual who is not affected by a disease or who does not experience negative side 
effects in response to a given drug). 

[0037] Cancer phenotypes are studied in some aspects of the business method. Examples 
of cancer include, but are not limited to, breast cancer, skin cancer, bone cancer, prostate 
cancer, liver cancer, lung cancer, brain cancer, cancer of the larynx, gallbladder, pancreas, 
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rectum, parathyroid, thyroid, adrenal, neural tissue, head and neck, colon, stomach, 
bronchi, kidneys, basal cell carcinoma, squamous cell carcinoma of both ulcerating and 
papillary type, metastatic skin carcinoma, osteo sarcoma, Ewing's sarcoma, veticulum cell 
sarcoma, myeloma, giant cell tumor, small-cell lung tumor, gallstones, islet cell tumor, 
primary brain tumor, acute and chronic lymphocytic and granulocytic tumors, hairy-cell 
tumor, adenoma, hyperplasia, medullary carcinoma, pheochromocytoma, mucosal 
neuronms, intestinal ganglioneuromas, hyperplastic corneal nerve tumor, marfanoid 
habitus tumor, Wilm's tumor, seminoma, ovarian tumor, leiomyomater tumor, cervical 
dysplasia and in situ carcinoma, neuroblastoma, retinoblastoma, soft tissue sarcoma, 
malignant carcinoid, topical skin lesion, mycosis fungoide, rhabdomyosarcoma, Kaposi's 
sarcoma, osteogenic and other sarcoma, malignant hypercalcemia, renal cell tumor, 
polycythemia vera, adenocarcinoma, glioblastoma multiforma, leukemias, lymphomas, 
malignant melanomas, epidermoid carcinomas, and other carcinomas and sarcomas. 
[0038] Cardivascular disease may be studied in other applications of the invention. 
Examples of cardiovascular disease include, but are not limited to, congestive heart 
failure, high blood pressure, arrhythmias, cholesterol, Wolff-Parkinson- White Syndrome, 
long QT syndrome, angina pectoris, tachycardia, bradycardia, atrial fibrillation, ventricular 
fibrillation, congestive heart failure, myocardial ischemia, myocardial infarction, cardiac 
tamponade, myocarditis, pericarditis, arrhythmogenic right ventricular dysplasia, 
hypertrophic cardiomyopathy, Williams syndrome, heart valve diseases, endocarditis, 
bacterial, pulmonary atresia, aortic valve stenosis, Raynaud's disease, Raynaud's disease, 
cholesterol embolism, Wallenberg syndrome, Hippel-Lindau disease, and telangiectasis. 
[0039] Inflammatory disease may be studied in other applications of the business method. 
Examples of inflammatory disease include, but are not limited to, rheumatoid, arthritis, 
non-specific arthritis, inflammatory disease of the larynx, inflammatory bowel disorder, 
pelvic inflammatory disease, inflammatory disease of the central nervous system, temporal 
arteritis, polymyalgia rheumatica, ankylosing spondylitis, polyarteritis nodosa, Reiter's 
syndrome, scleroderma, systemis lupus and erythematosus. 

[0040] Infectious disease may be studied in still further aspects of the business method. 
Examples of infectious disease include, but are not limited to, AIDS, hepatitis C, SARS, 
tuberculosis, sexually transmitted diseases, leprosay, lyme disease, malaria, measles, 
meningitis, mononucleosis, whooping cough, yellow fever, tetanus, arboviral encephalitis, 
and other bacterial, viral, fungal or helminthic diseases. 
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[0041] Samples may be collected from a variety of sources in a given patient depending 
on the application of the business. In some embodiments samples are collected on the 
account of the company itself, while in other examples they are collected in collaboration 
with an academic collaborator or pharmaceutical collaborator that, for example, is 
collecting samples in a clinical trial. Samples collected are preferably bodily fluids such 
as blood, serum, sputum, including, saliva, plasma, nipple aspirants, synovial fluids, 
cerebrospinal fluids, sweat, urine, fecal matter, pancreatic fluid, trabecular fluid, 
cerebrospinal fluid, tears, bronchial lavage, swabbings, bronchial aspirants, semen, 
precervicular fluid, vaginal fluids, pre-ejaculate, etc. In a preferred embodiment, a sample 
collected is approximately 1 to 5 ml of blood. 

[0042] In some instances, samples may be collected from individuals over a longitudinal 
period of time (e.g., once a day, once a week, once a month, biannually or annually). 
Obtaining numerous samples from an individual over a period of time can be used to 
verify results from earlier detections and/or to identify an alteration in polypeptide pattern 
as a result of, for example, aging, drug treatment, pathology, etc. Samples can be obtained 
from humans or non-humans. In a preferred embodiment, samples are obtained from 
humans. 

[0043] Sample preparation and separation can involve any of the following procedures, 
depending on the type of sample collected and/or types of protein searched: removal of 
high abundance polypeptides (e.g., albumin, and transferrin); addition of preservatives and 
calibrants, desalting of samples; concentration of sample polypeptides; protein digestions; 
and fraction collection. Preferably, sample preparation techniques concentrate 
information-rich polypeptides (e.g., polypeptides that have "leaked" from diseased cells) 
and deplete polypeptides that would carry little or no information such as those that are 
highly abundant or native to serum. 

[0044] Sample preparation can take place in a manifold or preparation/separation device. 
In preferred embodiment, such preparation/separation device is a microfluidics device. 
Optimally, the preparation/separation device interfaces directly or indirectly with a 
detection device. In another embodiment, such preparation/separation device is a fluidics 
device. 

[0045] Approximately 100 }iL of a sample is analyzed per assay in some particular 
embodiments of the invention. Removal of undesired polypeptides (e.g., high abundance, 
uninformative, or undetectable polypeptides) can be achieved using high affinity reagents, 
high molecular weight filters, untracentrifiigation and/or electrodialysis. High affinity 
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reagents include antibodies that selectively bind to high abundance polypeptides or 
reagents that have a specific pH, ionic value, or detergent strength. High molecular weight 
filters include membranes that separate molecules on the basis of size and molecular 
weight. Such filters may further employ reverse osmosis, nanofiltration, ultrafiltration and 
microfiltration. 

[0046] Ultracentrifugation is another method for removing undesired polypeptides. 
Ultracentrifugation is the centrifugation of a sample at about 60,000 rpm while monitoring 
with an optical system the sedimentation (or lack thereof) of particles. Finally, 
electrodialysis is an electromembrane process in which ions are transported through ion 
permeable membranes from one solution to another under the influence of a potential 
gradient. Since the membranes used in electrodialysis have the ability to selectively 
transport ions having positive or negative charge and reject ions of the opposite charge, 
electrodialysis is useful for concentration, removal, or separation of electrolytes. 
[0047] In a preferred embodiment, the manifold or micro fluidics device performs 
electrodialysis to remove high molecular weight polypeptides or undesired polypeptides. 
Electrodialysis is first used to allow only molecules under approximately 30 kD (not a 
sharp cutoff) to pass through into a second chamber. A second membrane with a very 
small molecular weight (roughly 500 D) will allow smaller molecules to egress the second 
chamber. 

[0048] After samples are prepared, polypeptides of interest may be separated. Separation 
can take place in the same location as the preparation or in another location. In a preferred 
embodiment, separation occurs in the same micro fluidics device where preparation occurs, 
but in a different location on the device. Samples can be removed from an initial manifold 
location to a micro fluidics device using various means, including an electric field. In 
preferred embodiment, the samples are concentrated during their migration to the 
microfluidics device using reverse phase beads and an organic solvent elution such as 50% 
methanol. This elutes the molecules into a channel or a well on a separation device of a 
microfluidics device. 

[0049] Separation can involve any procedure known in the art, such as capillary 
electrophoresis (e.g., in capillary or on-chip) or chromatography (e.g., in capillary, column 
or on a chip). 

[0050] Electrophoresis is the separation of ionic molecules such as polypeptides by 
differential migration patterns through a gel based on the size and ionic charge of the 
molecules in an electric field. Electrophoresis can be conducted in a gel, capillary or on a 
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chip. Examples of gels used for electrophoresis include starch, acrylamide, agarose or 
combinations thereof. In a preferred embodiment, polyacrilamide gels are used. A gel can 
be modified by its cross-linking, addition of detergents, immobilization of enzymes or 
antibodies (affinity electrophoresis) or substrates (zymography) and pH gradient. 
Examples of capillaries used for electrophoresis include capillaries that interface with an 
electrospray. 

[0051] Capillary electrophoresis (CE) is preferred for separating complex hydrophilic 
molecules and highly charged solutes. Advantages of CE include its use of small samples 
(sizes ranging from 1 to 10 ul), fast separation, easily reproducible, and the ability to be 
coupled to a mass spectrometer. CE technology, in general, relates to separation 
techniques that use narrow bore fused-silica capillaries to separate a complex array of 
large and small molecules. High voltages are used to separate molecules based on 
differences in charge, size and hydrophobicity. Depending on the types of capillary and 
buffers used, CE can be further segmented into separation techniques such as capillary 
zone electrophoresis (CZE), capillary isoelectric focusing (CIEF) and capillary 
electrochromatography (CEC). 

[0052] Capillary zone electrophoresis (CZE), also known as free-solution CE (FSCE), is 
the simplest form of CE. The separation mechanism of CZE is based on differences in the 
charge-to-mass ratio of the analytes. Fundamental to CZE are homogeneity of the buffer 
solution and constant field strength throughout the length of the capillary. The separation 
relies principally on the pH-controlled dissociation of acidic groups on the solute or the 
protonation of basic functions on the solute. 

[0053] Capillary isoelectric focusing (CIEF) allows amphoteric molecules, such as 
polypeptides, to be separated by electrophoresis in a pH gradient generated between the 
cathode and anode. A solute will migrate to a point where its net charge is zero. At this 
isoelectric point (the solute's pi), migration stops and the sample is focused into a tight 
zone. In CIEF, once a solute has focused at its pi, the zone is mobilized past the detector 
by either pressure or chemical means. 

[0054] CEC is a hybrid technique between traditional liquid chromatography (HPLC) and 
CE. In essence, CE capillaries are packed with HPLC packing and a voltage is applied 
across the packed capillary, which generates an electro-osmotic flow (EOF). The EOF 
transports solutes along the capillary towards a detector. Both differential partitioning and 
electrophoretic migration of the solutes occurs during their transportation towards the 
detector, which leads to CEC separations. It is therefore possible to obtain unique 
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separation selectivities using CEC compared to both HPLC and CE. The beneficial flow 
profile of EOF reduces flow related band broadening and separation efficiencies of several 
hundred thousand plates per meter are often obtained in CEC. CEC also makes it is 
possible to use small-diameter packings and achieve very high efficiencies. 
[0055] Chromatography is another method for separating a subset of polypeptides. 
Chromatography is based on the differential absorption and elution of certain 
polypeptides. Liquid chromatography (LC), for example, involves the use of fluid carrier 
over a non-mobile phase. Conventional LC columns have an in inner diameter of roughly 
4.6 mm and a flow rate of roughly 1 ml/min. Micro-LC has an inner diameter of roughly 
1.0 mm and a flow rate of roughly 40 ul/min. Capillary LC utilizes a capillary with an 
inner diameter of roughly 300 im and a flow rate of approximately 5 ul/min. Nano-LC is 
available with an inner diameter of 50 um -1 mm and flow rates of 200 nl/min. Nano-LC 
can vary in length (e.g., 5, 15, or 25 cm) and have typical packing of CI 8, 5 um particle 
size. In a preferred embodiment, nano-LC is used. Nano-LC provides increased 
sensitivity due to lower dilution of chromatographic sample. The sensitivity of nano-LC 
as compared to HPLC is approximately 3700 fold. 

[0056] In preferred embodiments, the samples are separated on using capillary 
electrophoresis separation, more preferably CEC with sol-gels, or more preferably CZE. 
This will separate the molecules based on their eletrophoretic mobility at a given pH (or 
hydrophobicity in the case of CEC). 

[0057] In other preferred embodiments, the steps of sample preparation and separation are 
combined using microfluidics technology. A microfluidic device is a device that can 
transport liquids including various reagents such as analytes and elutions between different 
locations using microchannel structures. Microfluidic devices provide advantageous 
miniaturization, automation and integration of a large number of different types of 
analytical operations. For example, continuous flow microfluidic devices have been 
developed that perform serial assays on extremely large numbers of different chemical 
compounds. 

[0058] In a preferred embodiment, microfluidic devices are composed of plastic and 
formed by means of etching, machining, cutting, molding, casting or embossing. The 
microfluidics devices may alternatively be made from glass or silicon by means of 
etching, machining, or cutting. The microfluidic devices may be either single use for a 
single sample; multi-use for a single sample at a time with serial loading; single use with 
parallel multiple sample processing; multi-use with parallel multiple sample processing; or 
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a combination. Furthermore, more than one micro fluidics device may be integrated into 
the system and interface with a single detection device. 

[0059] Once prepared and separated, the polypeptides are automatically delivered to a 
detection device, which detects the polypeptides in a sample. In a preferred embodiment, 
polypeptides in elutions or solutions are delivered to a detection device by electrospray 
ionization (ESI). ESI operates by infusing a liquid containing the sample of interest 
through a channel or needle, which is kept at a potential (typically 3.5 kV). The voltage 
on the needle causes the spray to be charged as it is nebulized. The resultant droplets 
evaporate in a region maintained at a vacuum of several torr, until the solvent is essentially 
completely stripped off, leaving a charged ion. The charged ions are then detected by a 
detection device such as a mass spectrometer. In a more preferred embodiment, nanospray 
ionization (NSI) is used. Nanospray ionization is a miniaturized version of ESI and 
provides low detection limits using extremely limited volumes of sample fluid. 
[0060] In preferred embodiments, separated polypeptides are directed down a channel that 
leads to an electrospray ionization emitter, which is built into a micro fluidic device (an 
integrated ESI micro fluidic device). Preferably, such integrated ESI microfluidic device 
provides the detection device with samples at flow rates and complexity levels that are 
optimal for detection. Such flow rates are, preferably, approximately 50-200 uL/min. 
Furthermore, a microfluidic device is preferably aligned with a detection device for 
optimal sample capture. For example, using dynamic feedback circuitry, a microfluidic 
device may allow for control positioning of an electrospray voltage and for the entire 
spray to be captured by the detection device orifice. The microfluidic device can be sold 
separately or in combination with other reagents, software tools and/or devices. 
[0061] Calibrants can also be sprayed into detection device. Calibrants are used to set 
instrument parameters and for signal processing calibration purposes. Calibrants are 
preferably utilized before a real sample is assessed. Calibrants can interface with a 
detection device using the same or a separate interface as the samples. In a preferred 
embodiment, calibrants are sprayed into a detection device using a second interface (e.g., 
second spray tip). 

Polypeptide Detection 

[0062] Detection devices can comprise of any device that is able to detect polypeptide 
presence and/or level, including for example, NMR, 2-D PAGE technology, Western blot 
technology, immuoanalysis technology and mass spectrometry. In a preferred 
embodiment, the business model herein relies on a mass spectrometry to detect 
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polypeptides present in a given sample. There are various forms of mass spectrometers 
that may be utilized by the business method. 

[0063] In a preferred embodiment, the business method utilizes an ESI-MS detection 
device. An ESI-MS combines the novelty of ESI with mass spectrometry. Furthermore, 
an ESI-MS preferably utilizes a time-of-flight (TOF) mass spectrometry system. In TOF- 
MS, ions are generated by whatever ionization method is being employed and a voltage 
potential is applied. The potential extracts the ions from their source and accelerates them 
towards a detector. By measuring the time it takes the ions to travel a fixed distance, the 
mass of the ions can be calculated. TOF-MS can be set up to have an orthogonal- 
acceleration (OA). OA-TOF-MS are advantageous and preferred over conventional on- 
axis TOF because they have better spectral resolution and duty cycle. OA-TOF-MS also 
has the ability to obtain spectra at a relatively high speed. Brock et al. Anal. Chem (1998) 
70, 3735-41, discuss on-axis TOF known as Hadamard OA-TOF-MS. In addition to the 
MS systems disclosed above, other forms of EMI-MS include quadrupole mass 
spectrometry, ion trap mass spectrometry, and Fourier transform ion cyclotron resonance 
(FTICR-MS). 

[0064] Quadrupole mass spectrometry consists of four parallel metal rods arranged in four 
quadrants (one rod in each quadrant). Two opposite rods have a positive applied potential 
and the other two rods have a negative potential. The applied voltages affect the trajectory 
of the ions traveling down the flight path. Only ions of a certain mass-to-charge ratio pass 
through the quadrupole filter and all other ions are thrown out of their original path. A 
mass spectrum is obtained by monitoring the ions passing through the quadrupole filter as 
the voltages on the rods are varied. 

[0065] Ion trap mass spectrometry uses three electrodes to trap ions in a small volume. 
The mass analyzer consists of a ring electrode separating two hemispherical electrodes. A 
mass spectrum is obtained by changing the electrode voltages to eject the ions from the 
trap. The advantages of the ion-trap mass spectrometer include compact size, and the 
ability to trap and accumulate ions to increase the signal-to-noise ratio of a measurement 
[0066] FTICR mass spectrometry is a mass spectrometric technique that is based upon an 
ion's motion in a magnetic field. Once an ion is formed, it eventually finds itself in the cell 
of the instrument, which is situated in a homogenous region of a large magnet. The ions 
are constrained in the XY plane by the magnetic field and undergo a circular orbit. The 
mass of the ion can now be determined based on the cyclotron frequency of the ion in the 
cell. 

Attorney Docket N. 29 1 9 1 -707 17 
2283332_5.DOC 



[0067] In a preferred embodiment, the business model herein employs a TOF mass 
spectrometer, or more preferably, an ESI-TOF-MS, or more preferably an OA-TOF-MS, 
or more preferably a multiplexed OA-TOF-MS (a multiplexed TOF mass spectrometer), 
or more preferably a mass spectrometer having a dual ion funnel to support dynamic 
switching between multiple quadrapoles in series, the second of which can be used to 
dynamically filter ions by mass in real time. In preferred embodiments, the detection 
devices yields a spectrum of at least 150, more preferably 200, or more preferably 300 
spectrums per second. 

[0068] The detection device preferably interfaces with a separation/preparation device or 
microfluidic device, which allows for quick assaying of many of the polypeptides in a 
sample, or more preferably, most or all of the polypeptides in a sample. Preferably, a 
mass spectrometer is utilized that will accept a continuous sample stream for analysis and 
provide high sensitivity throughout the detection process (e.g., an ESI-MS). In another 
preferred embodiment, a mass spectrometer interfaces with one or more electrosprays, two 
or more electrosprays, three or more electrosprays or four or more electrosprays. Such 
electrosprays can originate from a single or multiple microfluidic devices. 
[0069] The detection system utilized preferably allows for the capture and measurement of 
most or all of the polypeptides are introduced into the detection device. It is preferable 
that one can observe polypeptides with high information-content that are only present at 
low concentrations. By contrast, it is preferable to remove those in advance that are, for 
example, common to all cells, especially those in high abundance. 

Signal Processing/Pattern Recognition 

[0070] The output from a detection device can then be processed, stored, and further 
analyzed or assayed using a bio-informatics system. A bio-informatics system can include 
one or more of the following: a computer; a plurality of computers connected to a 
network; a signal processing tool(s); a pattern recognition tool(s); and optionally a tool(s) 
to control flow rate for sample preparation, separation, and detection. 
[0071] Data processing utilizes mathematical foundations. Generally, dynamic 
programming is preferably used to align a separation axis with a standard separation 
profile. Furthermore, intensities may be normalized, preferably by fitting roughly 90% of 
the intensity values into a standard spectrum. The data sets are then fitted using wavelets 
that are specifically designed for separation and mass spectrometer data. Data processing 
preferably filters out some of the noise and reduces spectrum dimensionality. This allows 
the business to identify the more highly predictive patterns. 
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[0072] In some embodiments, data processing may also involve the calibration of a mass- 
axis using linear correction determined by the calibrants. Calibration can take prior to any 
sample detection; after sample detection; or in recurring intervals, for example, 
[0073] Following data processing, pattern recognition tools are utilized to identify subtle 
differences between phenotypic states. Pattern recognition tools are based on a 
combination of statistical and computer scientific approaches, which provide 
dimensionality reduction. Such tools are scalable. 

Examples 

[0074] The following prophetic example illustrates certain aspects of the invention. 
[0075] Approximately one to five ml of blood will be collected through venipuncture into 
special tubes that contain the appropriate calibrants/controls. Following thorough clot 
formation, serum will be isolated from sample following centrifugation. Serum sample 
will be aliquoted and frozen at -70C until analysis. On the order of 100 uL of thawed 
sample will be placed in a disposable plastic device that fits into a manifold, and hereafter, 
the entire process would be automated. The device will perform electrodialysis on the 
sample. Using an electric field and tangential flow, the sample will be passed through a 
membrane that allows only molecules under approximately 30kD (not a sharp cutoff) to 
pass through into a second chamber. Molecules of with the opposite charge or large 
molecules will not pass. A second membrane with a very low molecular weight cutoff 
(-500D) will allow small molecules to pass out of the second chamber. Molecules that 
remain in the second chamber will therefore be in a MW range (500D-30kD). Most of 
these molecules will be peptides, protein fragments and small proteins. Salts will have 
been removed, as will most of the abundant polypeptides, such as albumin. This process 
should take approximately 60 minutes. 

[0076] The molecules of interest (i.e. those that remain in the second chamber) will then 
be moved to another location on the disposable device, again using an electric field, and 
onto reverse phase beads for sample concentration. Using an organic solvent elution such 
as 50% methanol, the molecules will be eluted into a channel or well on a second 
disposable device, this time a micro fluidics chip. On this chip, a 1-5 minute capillary 
electrophoretic separation, CZE or CEC, will be run to separate the molecules on the basis 
of electrophoretic mobility at the given pH (or hydrophobicity in the case of CEC). 
Preferred separation peak widths under 1 second will be utilized. 

[0077] Separated molecules will be directed down a channel that leads to a electrospray 
ionization emitter that is built onto each chip. Expected flow rates are 50-200 uL/min. 
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Prior to starting the separation, the microfluidics device will be aligned with the mass 
spectrometer using dynamic feedback circuitry to optimally control positioning stage 
placement and electrospray voltage to establish a stable spray and, assuming appropriate nl 
flow rates, allow the entire spray to be captured in the mass spectrometer orifice. 
Standards/calibrants would also be sprayed into the mass spectrometer using a dedicated 
second spray tip and used to set instrument parameters and for signal processing 
calibration purposes before the real samples are run. 

[0078] An orthogonal multiplexed mass spectrometer captures the spray from the 
prepared/separated sample (given that it is separated, the molecules will be migrating in 
small groups) and yield a spectrum at a rate of 200 spectrum/s. The mass spectrometer 
incorporates a dual ion funnel to support dynamic switching between calibrants and 
analyte sprays to optimize instrument accuracy. The instrument contains multiple 
quadrapoles in series, the second of which can, in real time during a data acquisition run, 
be used to dynamically filter ions by mass, thus allowing increased dynamic range or 
focus on particular mass ranges of interest. The orthogonal Multiplexed implementation 
allows multiple ion packets to fly in the flight tube while at the same time decoupling 
mass accuracy from beam modulation rate, thus supporting high throughput, high 
sensitivity, and high mass resolution. 

[0079] A resulting data set from one sample would have on the order of 10 9 data points. 
Each data set would take approximately 5 minutes to collect, from start to finish. While a 
data set is being analyzed, a second sample could be run through the system to increase 
throughput. 

[0080] Each data set would have its mass axis calibrated through a linear correction 
determined by the calibrants run before the sample and by the calibrants run in parallel in 
the dual ion funnel. Then dynamic programming would be used to align the separations 
axis (using the TIC) to some standard separations profile. Intensities would then be 
normalized by fitting the 90% intensity values to a standard spectrum. 
[0081] These corrected data sets would then be fit using wavelets (or vaguelettes) that are 
specifically designed for separations/mass spectrometer data. The parameterized 
information about the spectrum would be soft thresholded and otherwise filtered to both 
remove noise and reduce dimensionality. 

[0082] During pattern discovery, a set of approximately 50 case and 50 controls of these 
filtered parameter sets would be entered into a pattern recognition tool such as a linear 
support vector machine, but probably multiple learning algorithms will be used on each 
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data set. The space of tunable parameters for the learning machine will be searched, and 
optimal patterns that distinguish the sample classes will be found, as would be error 
bounds on that prediction using cross-validation. 

[0083] During validation or in clinical assay, the filtered parameters from each new data 
set would be classified into a category by identifying which side of the decision boundary 
in the multidimensional parameter space that data set lies. Confidence intervals could 
also be calculated. This prediction and confidence interval would be reported back to the 
technician running the machine. In some embodiments the information about these 
clinical samples would be captured and those results and clinical outcomes of those 
patients in pattern recognition using more samples would be used, yielding better patterns 
to improve classification. 

[0084] Eventually, polypeptides/patterns that give rise to the most important data points 
for prediction could be identified using a tandem mass spectrometry approach. Once a 
pattern is discovered, separations will be optimized to increase the amount of information 
about the polypeptides of interest, by slowing down separations during the elution of those 
polypeptides and speeding it up elsewhere. This would allow for the use of a separate, 
efficient assay for every diagnostic developed. 

[0085] It is to be understood that the above embodiments are illustrative and not 
restrictive. The scope of the invention should be determined with respect to the scope of 
the appended claims, along with their full scope of equivalents. 
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